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ABSTRACT 

Recommendation is usually reduced to a prediction problem 
over the function r{ua,ei) that returns the expected rating 
of element e,; for user Ua- In the IPTV domain, we deal with 
an environment where the definitions of all the parameters 
involved in this function (i.e., user profiles, feedback ratings 
and elements) are controversial. To our knowledge, this pa- 
per represents the first attempt to run collaborative filtering 
algorithms without inner assumptions: we start our analysis 
from an unstructured set of recordings, before performing a 
data pre-processing phase in order to extract useful infor- 
mation. Ifence, we experiment with a real Digital Video 
Recorder system where EPG have not been provided to the 
user for selecting event timings and where explicit feedbacks 
were not collected. 

Categories and Subject Descriptors 

H. 3.3 [Information Storage and Retrieval]: Information 
Search and Retrieval — Information filtering; J. 4 [Computer 
Applications]: Social And Behavioral Sciences 

General Terms 

Algorithms, Experimentation, Human Factors. 

Keywords 

Digital Video Recorders, TV Broadcasts, Recommendation 
Systems, Collaborative Algorithms, Implicit Data 

I. INTRODUCTION AND CONTEXT 

In the wide context of IPTV services. Digital Video Recorders 
(DVR) (a.k.a. Personal Video Recorder (PVR)), are hard- 
ware or software devices that record digital video to a mem- 
ory medium, for a further access (e.g., stand-alone set-top- 
boxes (STB), portable media players, and PC based DVRs). 
One of the most important exploitations of the DVRs is 
the media sharing: users may want to access to all their 
personal media resources (e.g., pictures, podcasts, TV pro- 
grams) from any of their own devices (e.g., laptops, smart- 



phones and so on). For example, OrtO is a freeware stream- 
ing software that enables users to access remotely their me- 
dia via Internet. In this paper, while focusing on the record- 
ing of radio and TV broadcasts, we keep in mind the general 
media sharing domain. 

Recording and personalization is changing the way providers 
insert advertisements during content delivery. The emer- 
gence of new behaviors and content consuming trends is forc- 
ing advertisers to look for new ways (e.g., a pull model) for 
spreading their commercials, trying to avoid the fast-forward 
of pre-recorded videos to skip commercials. In this domain, 
thus, recommender systems pj are becoming worthwhile as 
retention tools for customers. When a user is satisfied with 
suggestions, she dedicates more attention to the proposed 
links and references. Even if some recommendation engine 
has been tailored for DVRs (e.g., Neptuny's Content Wise, 
ReignSoft's Impress), a comprehensive analysis of the pecu- 
liarities of this domain is still missing. 

First of all, usage data collection is subject to serious pri- 
vacy concerns. Users are not willing to loose control of data 
they produce while taking advantage of recommendations 
and personalized information. Moreover, when a shared de- 
vice is used (e.g., the television) also family control issues 
arise, and it can be difficult to provide personalized informa- 
tion. Even for such reasons, costumers protect themselves 
behind fictitious on line identities, and this is an important 
challenge for many recommendation algorithms, that need 
user profiles to increase their accuracy. 

Another important problem in the IPTV domain ([5l|6]) is 
that, differently from other domains where recommenders 
can learn from explicit user ratings over content, in DVR 
systems we need to operate according to implicit feedbacks, 
such as recordings of a given event, downloading of pre- 
recorded videos, or - when possible - monitoring if the video 
has been effectively watched by the user. 
A third relevant problem, underestimated by previous works 
on this subject, is the difficulty of discriminating items. 
DVRs usually provide Electronic Program Guides (EPGs) 
to help users. Unfortunately, in a wider context, EPGs are 
not reliable to identify recommendable elements: TV chan- 
nels may not respect schedules, media and podcasts avail- 
able on the Web do not use a common schedule's format, 
users may be interested only in single parts of an event, and 
thus they will set up their own timings over a given chan- 
nel. In opposition with VoD, content description is difficult 
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in the TV domain. Broadcasts are announced, but often 
lack of structured meta-information: thus, collaborative fil- 
tering is usually preferred to content-based recommendation 
algorithms, even if its execution is not straightforward. 

Recommendation is usually reduced to a prediction problem 
over the function r{ua,ei) that returns the expected rating 
of element d for user Ua- As observed above, we are dealing 
with an environment where the definitions of all the param- 
eters involved in this function (i.e., user profiles, feedback 
ratings and elements) are controversial. To our knowledge, 
this is the first attempt to run collaborative filtering algo- 
rithms without inner assumptions. 

In this paper, we start our analysis from an unstructured set 
of recordings, before performing a data pre-processing phase 
in order to extract useful information. Ifence, in Section [2] 
we present a real system without EPGs where explicit feed- 
backs are not collected. Then, we describe the procedure 
for extracting meaningful information from the large and 
unstructured amount of provided data (Section [S]) and the 
analyzed recommendation algorithms (Section |4]| . Finally, 
the evaluation of the chosen algorithms is presented in Sec- 
tion O before drawing conclusions. 

2. THE FAUCET PVR ENVIRONMENT 

Our analysis is based on real data generated by the Faucet 
PVR system, integrated in a web-based podcasting service 
named FCasfl Faucet allows users to record their favorite 
(Italian) TV and Radio programs, and to further download 
them into their devices (e.g., iPod, PC, notebook) t4j. The 
user can set up her own programming and see or download 
her recordings by the use of a simple web interface. Bring- 
ing the ability to record and group into a single feed public 
and private channels (such as radio and TV recorded pro- 
grams). Faucet PVR offers a single framework for creating 
and aggregating personal podcast compilations. 

The Faucet PVR produces a very rich and dynamic datasefl 
populated by real users expressing their preferences through 
the recorded programs. Such a context, however, is char- 
acterized by a number of constraints which we had to deal 
with, in order to be able to perform the analysis on the 
recommendation algorithms. In particular, the intrinsic dy- 
namism and variability of the recordings, as well as the 
lack of any permanent event, require that a series of pre- 
processing steps have to be undertaken prior to be able to 
apply any recommender. 

A noticeable property characterizing the context in which 
we operate is the lack of a well defined programming for 
recorded contents. Despite several EPG sources do exist, 
we can not consider them reliable enough to be used for ex- 
tracting the input information of the recommender engine. 
Therefore, we decided to opt for a different approach. Ex- 
ploiting a bottom up approach, we rely on users' knowledge 
to define the most relevant properties of the events trans- 
mitted on the major TVs and radios. 

More precisely, the task of defining the parameters related 
^http : //www, vcast . it/"] 
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to every recorded transmission is therefore demanded to sin- 
gle users. Since it is their primary interest to make sure that 
the information inserted in the Faucet PVR are as much pre- 
cise as possible, we can consider such data reliable enough 
to be used in the recommendation process. Furthermore, a 
number of inferences can be deduced from the user activity, 
and considering also the good popularity of the system, also 
numerical processing is statistically reliable. 

The Faucet PVR involves three different steps to be taken 
by an user when she is interested in recording an event: the 
parameters setting, the execution and the downloading of 
the recorded item. All three steps are performed in different 
moments, in the aforementioned order. In the parameters 
setting step, the user chooses a channel, periodicity, name, 
starting and ending times. This step must be done before the 
beginning of the program. The execution phase starts at the 
starting time and finishes at ending time. The downloading 
step is available only after the recording finishes. 

As mentioned above, an event can be periodic: if the user 
wishes, the system records the desired event in regular in- 
tervals. These intervals can be of a week or a day, and in the 
case of a daily event, the user can choose to skip weekends. 
Events classified as non-periodic have absolute starting and 
ending times. However, in the case of periodic events, start- 
ing and ending do not represent absolute times, but rather 
a weekday and daytime (in the case of weekly events) or 
just a daytime (in the case of daily events). Also, in the 
case of periodic events, there is one parameter setting step, 
but the execution step can occur an undefined number of 
times. After each execution step, a download referring to it 
is made available. The system limits the number of accu- 
mulated recordings to 3 in order to save resources (only the 
last 3 executions are available to download). 

The fact that the dataset includes information about period- 
icity implies some issues in properly determining the events 
broadcasted on TV and radio from the amount of recordings 
made by users. On the other side, it decreases the complex- 
ity of calculating recommendations, resulting in an overall 
improvement in their novelty. In fact, without taking into 
account the periodicity, as in [9], the recommender has to 
explicitly ignore periodic elements recently seen by the user, 
in order to provide a more valuable and accurate recom- 
mendation. In our domain, as the periodicity is an intrinsic 
feature of the recommendable items, we do not have such a 
constraint, being these elements automatically excluded. 

The goal of a recommendation system in the PVR context 
is to suggest a personalized set of transmissions to the users. 
However, data coming from the Faucet PVR are not immedi- 
ately usable to identify events such as the transmissions, but 
assume the form of unstructured information, which have to 
be properly processed. In particular, let T be the set of 
transmissions during a day and ti be a specific transmission 
broadcasted on channel Ct . , starting at time bt^ and ending 
at time et . . Then, ti can be directly used in the recommen- 
dation engine, as well as Wt £ T. On the contrary, data 
collected by the Faucet system (i.e., users' recordings) differ 
from ti in the sense that they define a set R of several events 
ri with a temporal validity, each referring to a specific event. 
However, recordings with different timings may refer to the 



same broadcast: given the pair r^, rj £ R, they may refer to 
the same transmission even if ri 7^ r j . Clearly, this property 
does not hold with discrete and well defined events such as 
the transmissions. 

As well as we can not exploit any EPG to identify the 
recorded contents, we can not even rely on any information 
about the specific content of each recording. Indeed, the 
Faucet PVR does not provide any reference to the type of 
transmission recorded (e.g., sport program, news, or comedy- 
movie), nor we can rely on information inserted by users in 
title field, as the insertion of titles and annotations is com- 
pletely free, and this results in very diversified and, possibly, 
incorrect descriptions. 

A further implication due to the lack of information about 
the content of the recordings is the impossibility of applying 
content based recommenders, which focus on the descrip- 
tion of item and user profiles in order to recommend items 
[14j . On the contrary, our approach relies only on the be- 
havior and characteristics of large interconnected networks, 
by exploiting relations between their users. Each specific 
user follows her personal behavioral pattern in the usage of 
the Faucet PVR, and we can investigate these patterns to 
compute a similarity among users. 

As a final observation, broadcasting is characterized by the 
exptration of some events: we can suggest the user to record 
only future broadcasts, and even if some shows are serialized, 
the recording of the single episode should be programmed 
in advance. This phenomenon is (partially) due to copy- 
right management, since the content provider are not willing 
to authorize service providers to store previously recorded 
event for further distribution. Nevertheless, recording of 
a broadcast is still allowed, because it is seen as a single 
user activity. As a consequence, we have to deal (also) with 
volatile content, and this differs very much with the VoD 
domain, that has been exhaustively explored in the context 
of recommendation systems. 

3. DATA EXTRACTION 

Due to the specific domain, we are required to perform a 
pre-process of the data obtained from the Faucet PVR prior 
to be able to use such information as input for a recommen- 
dation algorithm. This is needed because the Faucet system 
generates a set of recordings in the continuous domain of 
timings, while a recommender system requires to operate in 
the discrete domain of events. 

As a consequence, the first goal that we have to accomplish 
is the identification of the broadcasted transmissions from 
the amount of unstructured data resulting from the record- 
ing process. This is a multi-step procedure, whose aim is 
to identify a set of discrete elements as the representatives 
of the broadcasted events. Basically, a discrete element is 
obtained as the result of the aggregation of several different 
recordings. A preliminary investigation on the extraction of 
events from recordings is given in 

Let U = {ui,U2, ...,Uk} be the set of distinct users in the 
Faucet platform. Each user in set U has recorded some pro- 
grams in the past and scheduled some for the future. To 
schedule a program, a user must choose a channel c e C, 



representing a list of predefined channels, and a periodicity 
p G {no — repeat, weekly, daily, mon — fri, mon — sat}, rep- 
resenting all the possible periodicities allowed in the PVR 
system. Besides, the user is required to annotate his/her 
recording with a (possibly) meaningful title. 

Let R= {r 1 , r2 , . . . , } be the set of the broadcasted recorded 
programs. Each element in R (a recording) is a tuple =< 
Ui,Ci,Pi,ti,bi, fi > set by a user Ui £ U who recorded on 
the channel Ci £ C with periodicity pi £ P a. program ti- 
tled ti with start time hi and end time fi. Thus, we can 
assume that there exists a function mapping every user to 
her recordings. 

The set R is first processed by means of clustering; then, 
aggregation and collapsing are carried out in sequence on 
the output of the clustering. The three phases are described 
in the following. 

Clustering. Due to the lack of information about the con- 
tent of each recording, they are clustered wrt the channel, 
the periodicity and the difference between timings. Specifi- 
cally, \/ri,rj £ R\cri = Apn ~ Prj we have that 

ri [+J rj iff \bri - br^ | < ^6 A \ fr, - frj \ < Sf, 

where |+J is the clustering function and St,Sf determine the 
maximum clustering distance for the start and end times, re- 
spectively. The identified clusters contain recordings equal 
in the channel and periodicity, and similar on the timing. 
The recording that minimizes the intra-cluster timing dis- 
tances is elected as the centroid of the cluster. At the end 
of the clustering, each cluster identifies an event. 

Aggregation. As the Faucet platform produces new record- 
ings with a hourly frequency, we perform the clustering once 
a hour obtaining a set of newly generated events. A further 
step is then required to possibly aggregate similar events, 
i.e., the new one with those previously created. Such an 
operation is performed as follows: (1) we compare each ele- 
ment generated with the clustering with the existing events 
wrt channel, periodicity and timings; (2) if the timings are 
similar, we correct the properties of existing events with the 
values of the newly created ones. The list of the users asso- 
ciated to the event is updated accordingly. 

Collapsing. Similar discrete elements, i.e. with the same 
channel and periodicity but timings within a fixed range, are 
merged into a single event. All features of the new events 
are computed by means of the values of the collapsed dis- 
crete elements. This operation is required basically because 
events can be created in subsequent moments, by aggregat- 
ing recordings referring to the same broadcasted transmis- 
sions. Due to the high variability of the timings, especially 
when a new transmission appears, such events slowly and 
independently converge to more stable timeframes, deter- 
mining the need of collapsing them into single events. 

As a result of the whole process, we obtain a number of 
events, each being a tuple defined as follows: 

ej =< {ue- },Cj, tlj , bj , fj , pj > 

where: 



• {mbj } is the list of users who set a recording referring 
to that event; 

• Cj is the channel; 

• tlj is the title chosen among those given by users; 

• bj and fj are the the starting and ending times respec- 
tively; 

• Pj G {no — repeat, weekly, daily, mon — fri,mon — sat} 
is the periodicity. 



In Figure [Tl we can observe the behavior of the system in a 
one year timeframe, i.e., from June 2008 to June 2009, wrt 
the number of users, events and recordings. As the number 
of active recordings and events (b) tends to increase over 
time, the number of users follows a different, less constant, 
trend. Specifically, we can notice a considerable increase in 
the number of users in the system between November 2008 
and March 2009. Such a happening implies a consequent 
raise in the number of recordings, due to the augmented 
activity in the system. Analogously, the number of events 
generated by the aggregations of the recordings grows up, 
although less noticeably if compared to the recordings. 
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Figure 1: Number of users, events and recordings 
(a) Total and (b) Active in the considered period 



recommendation algorithm is to suggest items specifically 
tailored to the user's preferences. 

4.1 Algorithms Overview 

Two well-known recommendation techniques are considered 
in this work: (1) the memory based collaborative filtering 
approach named fc-Nearest Neighbors (kNN) [12]; (2) the 
model based approach based on the SVD transform [13| . 
Exploiting the basic idea of the nearest neighbors approach, 
we apply both variants of the kNN algorithm: the user-based 
one 58,, by identifying users interested in similar contents; 
and the item-based approach [7], by focusing on items shared 
by two or more users. In addition, we also analyze the per- 
formance of a variant of the SVD technique based on implicit 
ratings, presented in pjj. 

User-based kNN. In the user-based kNN algorithm, the 
weight of an element for an user Uk can be defined as: 



w(Ufe,ei)= ^ T{Ua,ei) ■ c{uk,Ua) 
«'aSiV{Ufc) 



(1) 



where r{ua,ei) = 



1 if e, e 
if e, ^ 



Eu^ is the set of elements recorded by user Ua, whilst N{uk) 
is the neighborhood of user Uk, limited by considering only 
the top-A^ neighbors ordered by user similarity. The dif- 
ferent similarity functions are discussed in section 14.21 In 
case the number of neighbors is limited by the chosen sim- 
ilarity function to a number lower than k, we also consider 
the 2nd-level neighbors, i.e., for each user Ua belonging to 
N{uk) we compute N{ua). The overall set of Ist-level and 
2nd- level users is then used to define the users similar to Uk , 
as previously described. 

The coefficient c{uk,Ua) represents the neighbor's informa- 
tion weight for user Uk . In most of the kNN-based algorithms 
[S], the coefficient used is the similarity between Uk and Ma- 
in other cases [3] the coefficients are calculated using de- 
rived interpolation weights. It is worth noting that, in case 
of considering 2nd- level neighbors, the coefficient c{uk,Ua) 
in eq. ([l| has to be computed taking into account the sim- 
ilarity between the considered neighbor and further ones. 
For example, considering user Uk, her neighbor Ua and her 
2nd-level neighbor ut, we have: 

c{uk,Ub) = c{uk,Ua) * c{Ua,Ub), 

that is a combination of the similarities computed between 
the neighbors pairs for the considered user. 



4. RECOMMENDATION 

In our context, we can identify two different approaches to 
recommendation, depending on the specific target which is 
considered. As a first attempt, we define a set of events 
which can be of interest to the majority of the users. In such 
a case, we are trying to identify the most frequent events in 
the systems, i.e., those programs which have been recorded 
by the largest subset of users. This is done by means of a 
recommendation algorithm which we name MostPopular. 
A second approach focuses on identifying those events which 
can be of any interest for a single user of the system. In this 
case, which we can refer to as user-oriented, the aim of the 



MostPopular. The MostPopular algorithm can be also de- 
fined by means of eq. ([T]), assuming the number of neigh- 
bors unbounded, which implies N{uk) — U, \luk G U; and 

c{Ua,Ui,) = 1, yUa,Ub G U. 

The weight of an element to an user Uk is therefore defined 
as: 

w(?ffe,ei)= ^ r{ua,ei) (2) 
uaeu 

After calculating the weight of all elements, they are sorted 
in descendant order. In the MostPopular algorithm, as the 
set of neighbors is independent of the user, all users receive 



the same recommended elements, i.e., the most popular el- 
ements. 

Item-based kNN. In the item-based A;NN algorithm, the 
weight of an element for an user Uk is defined as: 

w{uk,ei)= ^ r(iife,ej) ■ c(ei,ej), (3) 

N{ei) is the set of n items most similar to Ci and recorder 
by Uk, and c(ei, Cj) is the neighbor's information weight wrt 
item Bi. 

Differently from the user-based case, using fc = oo in the 
item-based approach does not lead to the Most Popular set 
of elements. In fact, the algorithm simply takes all items 
Cj G Eu^. as neighbors of e;, making N{ei) user-dependent. 

SVD. The Singular Value Decomposition technique ana- 
lyzed in this work makes use of implicit feedbacks and im- 
plements the method proposed in [9j. Specifically, given the 
observations of the behavior of user u wrt item i, rui, we 
can define the user's preference as: 

_ / 1 if r„i > 
" \ if r„. = 

where rui is set to 1 when u records item i, otherwise. 

After associating each user u with a user-factors vector Xu G 
and each item i with an item- factors vector tji £ R-'^, 
we can predict the unobserved value by user u for item i 
through the inner product: x^tji. Factors are computed by 
minimizing the following function [9]: 

miri^(p„, - xlyif -f A I + j 

4.2 Computing Neighborhood 

In order to provide recommendation on the discrete ele- 
ments, we have to define a similarity function for grouping 
similar users/items from which choosing the appropriate el- 
ements to recommend. The definition of the similarity is 
based only on implicit ratings resulting from observing the 
behavior of users: if she records something, then we assume 
that she is interested in it; otherwise, we can not infer any- 
thing about the interest of the user for that element. We 
are therefore considering binary feedbacks. 

User-to-user. Let u and v be two users and Eu,Ev the 
sets of recorded elements associated to them; we can choose 
the similarity metric, S{u, v), considering several well known 
measures, such as: the Jaccard's coefficient, the Dice's co- 
efficient, the Cosine similarity and the Matching similarity 

m- 

Then, \/u, we can then compute the subset A''„ C [/ of neigh- 
bors of user It. A user v such that E^ n Eu 7^ is thus 
defined as a neighbor of u. Starting from the neighborhood 
of u, similarity with u is computed for each pair < u,v > 
such that V € Nu- 

Finally, if S{u, v) > 0, we consider u similar to v, i.e., there 
is an arc connecting them in the similarity network. The 
value S{u, v) is used to weight such a relation, therefore de- 
termining a similarity order among the neighborhood of u. 



Item-to-item. The similarity among items, 5(e, /), is based 
on the same measures already mentioned before, yet rede- 
fined considering two items e, / and their sets of users Ue, Uf 
who recorded them. 

Ve € -E we can compute the subset <Z E oi neighbors of 
item e. An item / such that f/e fl ?7/ 7^ is thus defined 
as a neighbor of e. Starting from the neighborhood of e, 
similarity with e is computed for each pair < e, / > such 
that f £ Ne. 

We can then decide whether a couple of items is similar or 
not. Items e is considered similar to /, i.e., there is an arc 
connecting them in the similarity network, if ^(e, /) > 0. A 
similarity order among the neighbors of e is thus determined. 

5. EXPERIMENTAL RESULTS 

Our evaluation is based on trying to measure how accurate is 
each recommendation algorithm in predicting the elements 
that users would program. This is achieved by computing 
precision and recall on the predicted items. The more accu- 
rate is this prediction, the more valuable elements are rec- 
ommended. It is important to underline that we do not 
consider any feedback related to the user's interest in the 
recommended items, but we only focus on the prediction 
ability of the algorithms analyzed. 

To start evaluating a recommendation algorithm, we fix an 
arbitrary time t after the data collection started and before 
the data collection stopped. The value of t should be care- 
fully chosen not to be too close to the data collection start, 
since we do not have sufficient data to make good predic- 
tions. Also, the time t should not be close to the end of 
data collection, because we need a good amount of data to 
make the verification if the algorithm was able to predict 
it. As the data collection started January 23rd 2008 and 
ended November 19th 2009, we choose values of t varying 
from June 1st 2008 to June 1st 2009. 

5.1 Metrics 

Given the set E of events in our framework, we define the 
following subsets: 

• A{t) C E, active events at time t {bj > t); 

• R{u, t) C E, events recorded by user u before time t; 

• V{u,t) C A{t), events recorded by user u after time t; 

• Rec{u,t) C A{t), events recommended to user u at 
time t. 

It is important to notice that A{t) is also the set of all ele- 
ments suitable for recommendation at time t. The aim of our 
recommendation algorithms is to predict which events are in 
V{u,t). For that, for each user, the algorithms associate a 
weight w{u,s) to each element s € A{t) which represents, 
from the recommender's point of view, how much reliable is 
the fact that s G V{u,t). Furthermore, a recommendation 
algorithm use only the information in 

[J R{u,t), with R{u,t)nV{u,t) = 9. 

ueu 

To recommend items to users, we use only the top n rec- 
ommended elements Rec{n,u,t) C Rec{u,t), i.e., the top n 
elements in Rec{u,t), ordered by weight. The precision and 



recall at time t are computed as the average of all users' 
precision and recall values computed using the top n rec- 
ommended elements [IJ]. Finally, we compute the system 
precision and recall at different times, and calculate the sys- 
tem overall precision and recall as the average of it. 

As in [9] , also in our context precision measures are not very 
meaningful, because we do not have feedbacks regarding the 
user's interest in those items which have not been considered 
(i.e., not programmed, nor downloaded). On the contrary, 
recall-oriented measures are more suitable. Infact, we can 
assume that is of any interest for user it only if Ci £ 
V{u,t), otherwise no assumption on user's interests can be 
made. Anyway, for sake of completeness, we also report the 
analysis of precision values. 

5.2 Evaluation 

As a first step in the evaluation, we attempt to define the 
specific upper and lower bounds which characterize the rec- 
ommendations in the PVR domain. In particular, we com- 
pare the MostPopular recommender, which identifies the 
most frequent elements among all users (Section 14.11) . with 
the following two algorithms: (1) a random recommender, 
which simply chooses n random elements among those of 
A{t), defining the lower bound to our experiment; (2) an ex- 
act predictor recommender, which has knowledge about the 
elements in V{u,t), thus yielding to the best possible results 
and defining the upper bound. 

The results are depicted in Figure [2(a)[ which clearly shows 
that, as expected, even the MostPopular algorithm can eas- 
ily outperform a random predictor. However, it is still far 
from being able to make a complete prediction of all the el- 
ements, especially when the considered top n are just few 
items. 

The second step in our evaluation is to study how different 
similarity functions affect the results of user-based fcNN rec- 
ommendation algorithms. We can observe from Figure 2(b) 



that, in case of the user-based algorithm, all chosen similar- 
ities show nearly the same performances. 
On the contrary, the Matching similarity considerably out- 
performs the other measures when it comes to the item- 
based algorithm, as displayed in Figure 2(c) Again, both 
Dice and Jaccard show a very similar behavior, being supe- 
rior to the Cosine metric already when more than 5 elements 
are recommended. In both Figures |2(b)| and |2(c)[ the Jac- 
card similarity is not shown being almost identical to the 
Dice. 

Another step in our evaluation is to find the consequences of 
adding second-level neighbors in the neighborhood of user- 
based fcNN recommendation algorithms. In Figure 3(a) we 



can observe that increasing the number of first level neigh- 
bors (when it is lower than k) by adding the second level 
ones implies a better performance of the algorithms. In this 
example, we used Dice similarity and k = 300, however the 
results are similar when applying second-level neighbors to 
other similarities. 

In the next tests, we try to find an optimal value for k in the 
user-based fcNN algorithm. Figure [3(b)] shows the results of 
fcNN user-based with k e {100, 300, 500, 700, 2000}, and the 
MostPopular recommender. We used Dice similarity, but 



the results are similar with other similarity functions. In 
addition, in Figure 3(b) as well as in Figure 3(c) we omit 
the values of fc = {500, 700} since the results are very similar 
to the case of fc = 300. 

We can observe that a value fc = 100 is not sufficient to 
outperform the MostPopular algorithm, due to the lower 
value of the recall. On the other side, a very high number of 
neighbors allows to perform better than the MostPopular. 
However, we could notice that, already with k = 2000, the 
algorithm starts to converge to the MostPopular, character- 
ized by an unbounded number of neighbors by definition. 
Therefore, we can consider the range [100, 2000] as suitable 
to identify the optimal value for k. 

For this purpose, we test the values k = {300, 500, 700}, ob- 
taining very similar performance. Considering the top 10 
recommended elements, we can achieve better results for 
k — 300, whilst k = 500 is more suitable when taking the 
top 11 to 30 elements. As in most cases 10 elements are suf- 
ficient for a recommendation, k = 300 is a good compromise 
between the ability of providing valuable recommendations 
and the resource consumption in calculating the neighbor- 
hood. 

To better observe the trend of both recall and precision, 
Figure [3(c) | shows the two values combined. Again, k = 300 
performs better if we take only the top 10 recommended ele- 
ments, as it also yields to good results in terms of precision. 
Considering more than 10 recommendations, it would seem 
appropriate to increase the number of neighbors to 500, as 
the results for precision and recall are slightly better. How- 
ever, the overall behavior of the algorithm is almost identical 
with k in the range {300, 700}. Nevertheless, the above men- 
tioned considerations regarding the superior performance of 
the kNN algorithm with k — 300 in terms of computation 
requirements still apply when we take into account the pre- 
cision metric. 

An interesting comparison among the three kNN algorithms 
analyzed, i.e., user-based, item-based and MostPopular, is 
depicted in Figure |4(a)[ We can observe that the latter is 
clearly outperformed by the other two algorithms in terms of 
recall, especially when more than 7 recommended items are 
considered. Between the item-based and the user-based ver- 
sion of the kNN, the latter performs slightly better, although 
the gap is mostly noticeable when more than 15 items are 
recommended. In general, item-based algorithms tend to 
perform better because usually the number of items is con- 
siderably lower than the users 12 . Such a property does 
not hold in our domain, hence making the user-based ver- 
sion superior in terms of recall, as we initially expected. 

A final experiment is attempted in order to measure the 
behavior of the SVD approach wrt the performance of the 
kNN method. The implementation of the SVD algorithms 
described in Section [4.11 is tested with different parameters, 
with the purpose of identifying the more suitable ones in 
our context. In particular, we try different sizes for user- 
factors and item-factors vectors, values for the A parameter 
and number of training steps. 

Results are depicted in Figure 4(b) The best prediction is 
obtained with 100 features, A — 500, a = 40 and 15 train- 
ing steps. However, the behavior of the latent factor model 
based on SVD in the analyzed context is worse if compared 
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Figure 4: Precision and recall for the analyzed algorithms. 



to a neighborhood model such as kNN. As the reader can 
notice, the kNN user-based is able to substantially outper- 
form the SVD technique, whose results in terms of recall are 
slightly better than those of the MostPopular algorithm only 
when a considerable number of items are recommended. 

Similarly, results related to the precision of the recommenda- 
tions (Figure |4(c)[ ) show an analogous behavior of the kNN 
algorithms wrt SVD, with the Most Popular being consider- 
ably less precise than others. Also, the user-based algorithm 



shows to be more precise than the item-based in determin- 
ing the recommendable items, for the same reason previously 
mentioned considering recall. 

It could appear surprising that the prediction performance 
of the SVD recommender is worse than other techniques, 
as this algorithm normally performs better in several other 
contexts |13U10l l9]. We believe that the motivations for such 
an unusual behavior reside in the dataset characteristics. In 
particular, a reason might be identified in the so called cold 



start problem, whose effects involve users, items and com- 
munities [Is] . 

In our context, the cold start problem is particularly notice- 
able with items and is due to the lack of relevant feedbacks 
when a new event first appears in the system. Such an is- 
sue is made worse by the fact that items to recommend are 
generally new ones, i.e. those events having a starting time 
in the future. This property holds for no-repeat events as 
well as for repetive ones (the starting time is updated ac- 
cording to their periodicity). So, events whose starting time 
has passed are no longer elegible for recommendation. 

The fact that recommendations are affected by the cold start 
problem is one key factor that may influence SVD perfor- 
mance, as this algorithm needs support of user's preferences 
to perform well. On the contrary, a neighborhood-based 
approach such as kNN appears to better deal with newly 
introduced items, as also reported in [5]. 

6. CONCLUSION 

We experimented with a real digital recording service, and, 
accordingly to the above mentioned restrictions, we decided 
to run our analysis under the strongest assumptions: no 
EPGs are available, users can set up timings as well as chan- 
nels, explicit feedbacks are not collected, and so on. In addi- 
tion, the intrinsically dynamic nature of the analyzed PVR 
domain, which determines a continuous process of creation 
and deletion of events and a consequent amplification of the 
cold start problem, makes such a context sensibly different in 
terms of recommendation if compared to those where items 
have no time validity (i.e., netflix, movielens, etc.). 

Despite these constraints, our results showed that neigh- 
borhood based strategies, such as kNN, can return in good 
prediction accuracy and, if correctly tuned, they can outper- 
form SVD-based techniques as well as most popular strate- 
gies, that dangerously leverage the phenomenon of many 
users concentrated on very few relevant events. 
Finally, there is evidence that digital recorders differ from 
other interest based services, because factors other than per- 
sonal tastes might infiuence the user's behavior and the suc- 
cess of a recommendation engine. In fact, the direct social 
influence of friends and the volatile nature of events are sup- 
posed to be relevant factors in causing a user to schedule a 
recording. 

In our opinion, a possibile future research direction could 
be indeed the identification and study of those social factors 
which affect user's behavior in systems characterized by high 
dynamism and short lifetime of items. 
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